# DATA PROCESSING - VIH/SIDA en América Latina: Datos Epidemiológicos por País 2000-2020 (Intervalos de 5 Años)

**DOI:** https://doi.org/10.7910/DVN/99WRWQ

## Pipeline
1. Raw ingestion from official sources
2. Deduplication and cleaning
3. ISO 3166 country code standardization
4. Missing value coding (NA)
5. Format conversion to TSV
6. Statistical validation
7. Packaging for Harvard Dataverse

## Software
- R 4.3+, Python 3.10+
